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Abstract 

After proper rescaling and under some technical assumptions, the 
smahest eigenvalue of a sample covariance matrix with aspect ratio 
bounded away from 1 converges to the Tracy-Widom distribution. 
This complements the results on the largest eigenvalue, due to Sosh- 
nikov and Peche. 

Part I 

Introduction 

It has been long conjectured that some of the asymptotic statistical properties 
that are known for eigenvalues of large matrices with Gaussian entries should 
be valid, in particular, for more general random matrices with independent 
entries. This is part of a phenomenon called 'universality' in the physical 
literature; see for example Conjecture 1.2.1, Conjecture 1.2.2, and various 
remarks scattered in Mehta's book [15j. 
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In particular, the local statistics of the eigenvalues at the edge of the 
spectrum should be the same as in the Gaussian case (precise definitions are 
provided below.) 

The first rigorous results of this kind are due to Soshnikov. In [21], he 
established a universality result at the edge for large Hermitian matrices with 
independent entries; we formulate his result as Theorem II. 1.31 below. Univer- 
sality at the edge for Hermitian random matrices with independent entries 
was further studied by Ruzmaikina [18j and Khorunzhiy and Vengerovsky [9]. 

In the subsequent work [22j, Soshnikov extended his method to the largest 
eigenvalues of the sample covariance matrices XX*, under some restrictions 
on the dimensions of the matrices X. These restrictions were later disposed 
of by Peche pTO] ; see Theorem II. 1.21 below. 

In the Hermitian case, the largest and the smallest eigenvalues are iden- 
tically distributed; Soshnikov's result encompasses both the largest and the 
smallest eigenvalue. The state of affairs is different for sample covariance 
matrices, the smallest eigenvalue of which is much smaller in absolute value 
than the largest one. Therefore Soshnikov's approach does not seem to be ap- 
plicable to the smallest eigenvalue of a sample covariance matrix; we discuss 
this further below. 

In this paper, we suggest a different approach, and apply it to prove a 
universality result for the smallest eigenvalue of a sample covariance matrix; 
see Theorem II. 1. II below. We also apply it to give another proof of the results 
of Soshnikov and Peche, Theorems II. 1.31 and II. 1.21 

In the special case of Gaussian matrices, alternative approaches are avail- 
able, and most of the results are known. The asymptotic distribution of the 
extreme eigenvalues of Gaussian Hermitian matrices has been first studied 
by Bronk [1] in the 1960-s, and more recently by Bowick and Brezin, Moore, 
Forrester, and finally by Tracy and Widom, who have established the con- 
clusion of Theorem II. 1.31 in the Gaussian case. Parallel results for Gaussian 
sample covariance matrices have been proved by Johansson, Johnstone, and 
Soshnikov, and Borodin and Forrester. We defer the precise references to 
Section 11.31 

In fact, our argument (as well as those of Soshnikov and Peche) involves 
reduction to the Gaussian case. We discuss this in detail in Section 11.31 
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I.l Formulation of results 



The entries of the random matrices that we shall consider in this paper will 
be (complex- valued) random variables r satisfying the following assumptions: 

(Al) the distribution of r is symmetric (that is, r and — r are identically 
distributed); 

(A2) E|rp^ < (CokY for some constant Co > (r has subgaussian tails.) 
Also, we shall assume that either 

(A3i) Er^ = Erf = 1 (or equivalently, r is real almost surely and Er^ = 1) 
or 

(ASs) Er^ = 0; Erf = 1 (that is, E(3?r)2 = E(5>r)2 = 1/2, E(3fJr ^Jr) = 0.) 
Our main result is 

Theorem I.l.l. Fix (3 G {1,2}. Let {X^^^n be a sequence of M{N) x 
matrices, M{N) < N, such that 

1. \imN-.+ooM{N) = +oo; limsupjv^+^ M(A^)/iV < 1; 

2. {Xuv^ I 1 < w < M{N), I < V < N} are independent and satisfy 
(A1),(A2), and (A3p). 

Let A^^'' he the smallest eigenvalue of B^^^ = X^^^'X^-^^* . Then the random 
variable 

{M{Ny/^ - ArV2) (M(iV)-i/2 _ Ar-i/2)i/3 

converges in distribution to the Tracy-Widom law TWp (cf. Section [77^ as 
iV ^ ocQ. 

Our method also yields new proofs of two known results. The complemen- 
tary result for the largest eigenvalue was proved by Soshnikov [22] (under 
additional restrictions on M{N)) and Peche [16] (in this generality): 

Theorem 1.1.2 (Soshnikov; Peche). Fix (3 G {1,2}. Let {X^^^]n be a 
sequence of M{N) x matrices, M{N) < N, such that 

^M{N) < N, so the denominator is negative. This is not a typo. 
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1. \imN^+^ M{N) = +00; 

2. {xiv^ I 1 < M < M{N), 1 < V < N} are independent and satisfy 
(A1),(A2), and (A3p). 

Let ^^^il^j^-^ be the largest eigenvalue of B^^'' = X^^^X*^^^*. Then the random 
variable 

{M{Ny/^ + ArV2) (M(Ar)-i/2 + x-i/2)i/3 
converges in distribution to the Tracy-Widom law TWp. 

The analogous theorem for Hermitian matrices was also proved by Soshnikov 
[2T] . and was the first universality result at the edge of the spectrum for 
matrices with independent entries. It was further studied by Ruzmaikina |T8] , 
and Khorunzhiy and Vengerovsky p]. 

Theorem 1.1.3 (Soshnikov). Fix [3 E {1,2}. Let {A^^^jy be a sequence of 
Hermitian NxN matrices such that {aH^^ | 1 < w < f < N} are independent 
and satisfy (A1),(A2), and, for u < v, (ASp). Let 

be the eigenvalues of A^^\ Then the random variables 

converge in distribution to the Tracy-Widom law TWp. 

Most of this paper is devoted to the proofs of Theorems II.l.im.l."3l In the 
following section (II. 2p . we state slightly more general results in terms of point 
processes. Some of the definitions are postponed to Section II. 4[ There we 
also explain why the formulations of Section [L2] imply those of Section [LTl In 
Section [L5] we formulate two technical statements, and deduce the results of 
Section 11.21 A guide to the subsequent sections, which are mostly devoted to 
the proof of the two technical statements, is provided at the end of Sect ion [L5i 
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1.2 Formulation of results: extended version 

Let us recall the definition of a point process and introduce a (slightly un- 
usual) topology. 

Definition 1.2.1. 

1. A point process ^ on M is a random integer- valued locally finite Borel 
measure on M. 

2. Let ^1,^2, - ■ ■ , ^N, ■ ■ ■ ]C,he point processes on M. We shall write ^ 
if / fd$,N —>■ J fdC, (in distribution) for any bounded / G C(M) such 
that supp / n ]R_ is compact. 

Theorem 1.2.2. Under the assumptions of Theorem \I.l.ll let 

.(TV) .(TV) .(TV) 

be the eigenvalues of B^^^ = X^^^X*^^)*, and let 



Vi 



{M{Ny/^ - iVi/2) (M(X)-V2 _ iV-i/2)V3 • 
Then the point processes 

converge in distribution to the Airy point process 2ti/3.' 

We shall recall the definition of 2ti/3 in Section [L4l 

Theorem 1.2.3 (Soshnikov; Peche). Under the assumptions of Theorem U.l.^ 
let 

.(TV) .(TV) .(TV) 
^1 ^ A2 ^ ■ • • S ^M(N) 

be the eigenvalues of B^^^ = X'^^'^X^-^'^* , and let 



{M{Nyi'^ + Ari/2) (M(X)-V2 + iv-i/2)V3 
Then the point processes 



converge in distribution to the Airy point process 2ti/; 



Theorem 1.2.4 (Soshnikov). Under the assumptions of Theorem \I.1.3[ let 

A(^)<Af)<---<ASv^) 
be the eigenvalues of A^^\ and let 

Then the point processes 
and 

converge in distribution to the Airy point process "Qixp. 



1.3 Some Remarks 

The most important example of random matrices satisfying the assumptions 
of Theorems II.l.HI.1.21 is the Wishart Ensemble: 

Example 1.3.1. 

1. For /5= 1, Xi^y^ ~ Ar(0,l); 

2. For 13 = 2, xiV ~ A^(0, 1/2) + iA^(0, 1/2) (meaning that the real and 
imaginary parts of X^'' are independent Gaussian variables.) 

We denote the random matrix X^^^ by X^^J (suppressing the dependence 
on /3),and set b[Z^ = X^^^ X^Z^*. 

Similarly, the most important example of random matrices satisfying the 
assumptions of Theorem [LL3] is the Gaussian Orthogonal/Unitary Ensemble: 

Example 1.3.2. 

1. (3 = 1: in the Gaussian Orthogonal Ensemble (GOE), 



iV(0,l), u^v 
Ar(0,2), u = v . 
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2. j3 = 2: in the Gaussian Unitary Ensemble (GUE), 



iV(0, 1/2) + zAr(0, 1/2) 
iV(0,l) , 



U ^ V 



U = V . 



We denote the matrix A^^^ defined above by A\^J . 

The main feature of these examples is the invariance property: the distri- 
bution of A[^J , Arw^ is invariant under conjugation by arbitrary orthogonal 
matrices (for /3 = 1) or unitary matrices (for (3 = 2). This feature facilitates 
the study of the eigenvalues of these matrices, and indeed, most of the results 
have been proved much earlier in this special case. 

In particular, the conclusion of Theorem II.2.4I was proved for A^^^ in the 
early 90-s, by Bowick and Brezin, Forrester, Moore, and others, building 
on earlier work by Wigner, Dyson, and Mehta (see [13 [21] and references 
therein.) 

The conclusion of Theorem 11.2.31 was established for the invariant case 
b[^J by Johansson [7] (for (3 = 2) and Johnstone [8] (for (3 = 1); see also 
Soshnikov |22]- The conclusion of Theorem 11.2.21 was proved for b[^^ by 
Borodin and Forrester p5], under the weaker assumption — M{N) —>■ +oo 
(instead of limsup M{N)/N < 1). 

It has been long conjectured that, in the asymptotic limit — > oo, some 
of the statistical properties that were proved for the eigenvalues of matrices 
with Gaussian entries should be valid, in particular, for more general ran- 
dom matrices with independent entries. See for example Conjecture 1.2.1, 
Conjecture 1.2.2, and various remarks scattered in Mehta's book [15]. In 
particular, this should be true for local statistics of the eigenvalues at the 
edge of the spectrum. 

The first rigorous results of this kind are due to Soshnikov. In [21], he 
established Theorem II.2.4[ The main step in his proof is to show that the 
asymptotics of the mixed moments 



does not depend on the distribution of the entries of A^^\ when /? is fixed 
and mi,-- - ,mfc = O^N"^^^). This reduces Theorem II. 2. 41 to the invariant 



EtrA(^)'"^--trA(^) 



(1.3.1) 
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In the subsequent work [22], Soshnikov applied a similar method to the 
largest eigenvalues of the sample covariance matrices B^^\ and proved The- 
orems [LL2|[L2]3l under some additional restrictions on M{N). These restric- 
tions were later disposed of by Peche \16\ . 

This method does not seem to be directly applicable to the smallest eigen- 
value ofB^^K since the asymptotics of fll.3.ip does not depend on the eigen- 
values that are small in absolute value. In this paper, we make use of a 
modified technique, using traces of certain orthogonal polynomials of A^^\ 
B^^\ This technique is based on an idea going back to Bai and Yin [2], which 
was developed in several subsequent works; see [20] and references therein. 



1.4 More definitions 

For the convenience of the reader, we provide some definitions; this section 
is copied, up to change of notation, from the work of Soshnikov [21j . 

Definition 1.4.1. The measure pk = Pk,^ = E^®*^ on M'^ is called the k-point 
correlation measure of a point process ^. 

Remark 1.4.2. Thus defined, the correlation measures have singular com- 
ponents on the diagonals {xi = X2}, et cet. It is common to modify the 
definition to annihilate these singular components. However, the modified 
correlation measures pk o^^e uniquely determined by pk, and vice versa; thus 
the difference is not very essential, and we find it more convenient to work 
with Pk as above. 

Remark 1.4.3. In general, a point process is not uniquely defined by its 
correlation measures. However, a sufficient condition due to Lenard 
ensures uniqueness for the processes that we encounter in this paper. 

For the sequel, let us introduce a topology on measures: 

Definition 1.4.4. Let {pn} be a sequence of measures on M^. We shall write 
Pn ^ p a f fdpn — > / fdp for any bounded continuous function / on M} 
such that supp / fl is compact. 

Definition 1.4.5. The Airy function Ai is (uniquely) defined by 

Ai"(x) = X Ai(x) , Ai(x) ~ — , exp f — x^^^ ) , x +00 . 

2w7rx'-''^ V 3 / 
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Definition 1.4.6. 

1. The Airy point process 2li2 is the (unique) point process such that, for 
every k and any compact set 

T C {{xi,- ■ ■ ,Xk)\xi < ■ ■ ■ < Xk} , 

the restriction p^It is absolutely continuous with respect to the Lebesgue 
measure, and 



dxi ■ ■ ■ aXk ^ ' i<i,j<k 

Ai{x) Ai'{x') - Ai'{x) Ai{x') 



dxi ■ ■ • dXk V / l<i,j<k 

where 

K(x, x' 



X — x' 

2. The Tracy-Widom law TW2 is defined by its cumulative distribution 
function 

r I'+co 

F2{x) — exp < — / (s — x)q^{s)ds , 



where q{-) is the solution to the IP^ Painleve equation: 

q"{s)^sq{s) + 2q{s)' , 

such that 

q{s) ~ Ai(s) , s — > +00 
(the so-called Hastings-McLeod solution.) 

For P = 1, the density of pk can be expressed as the square root of the 
determinant of a 2/c x 2k block matrix, which is composed of 2 x 2 blocks. 
Denote 

DK{x,x') = -1-K{x,x') , 

r+oo 2 

JK{x,x') = — K{x",x')dx" sign (a; — a;') ; 

Jx 2 

then let 

K{x,x') DK{x,x') 



Ki{x,x') 



JK{x,x') K{x,x') 
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Definition 1.4.7. 

1. The Airy point process 2lii is tlie (unique) point process such that, for 
every k and any compact set 

T C {{xi,- ■ ■ ,Xk)\xi < ■ ■ ■ < Xk} , 

the restriction p^It is absolutely continuous with respect to the Lebesgue 
measure, and 



dpklrixi, ■■■ ,Xk) 



det (^Ki(xi, Xj] 



dxi ■ ■ ■ dxk V ^ ^ i<j.i<fc 

2. The Tracy-Widom law TWi is defined by its cumulative distribution 
function 

Fi{x) = exp i^- J + (s - x)g^(s)] ds| . 

Theorem (Tracy-Widom [MIEH])- For P G {1,2}, the distribution of the 
rightmost atom of 211/? is exactly TWjs . 

The functional that sends a locally finite configuration of points (= lo- 
cally finite integer-valued Borel measure) to its rightmost point (= atom) is 
continuous with respect to the convergence and therefore Theorem II.2.2I 
implies Theorem II.l.ll Theorem II.2.3I implies Theorem II.1.2t Theorem IL2.4I 
implies Theorem II.1.3[ 



1.5 The main technical statements 

Definition 1.5.1. The Chebyshev polynomials of the second kind are defined 
as follows: 

TT ( Q\ sin((ra + 1)^) 

Unicode) = — . (1.5.1) 

sm & 

The following elementary proposition may clarify the connection between 
Un and the spectra of the matrices considered in this paper. We shall not 
use it, and therefore omit the proof (see e.g. [201 §5.1].) 

Proposition 1.5.2. 
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1. The polynomials Un are the orthogonal polynomials with respect to 
Wigner's semicircle measure a^/: 

daw[x) 2 2x1/2 
— — 1 — X ^ 



That is, 

Un{x)Un'{x)daw{x) = 5„ 



2. For < s < 1, the polynomials Vn^s = Un + \fsUn-i are orthogonal 

is] 

with respect to the Marchenko-Pastur measure (j)Jp: 



dx TT (1 + s) + 2yix 

In the next parts of this paper we shall prove the following two statements: 

Theorem 1.5.3. Fix (3 E {1,2}, and let {A^^^} be a sequence of random 
matrices satisfying the assumptions of Theorem \I.1.3[ Fix k > 1, and let 
{{nf^\ ■ ■ ■ ,nl^^)}N be a sequence of k -tuples. 

1. IfY^nf^ = 1 mod 2, 

k 

Entof/„(iv)(AW/(2v/iV^)) = . 

i=l 

2. Suppose ^'^i^'* = 2n^^\ There exists a constant C (depending only 
on Co in (A2)), such that 

k 

EjItrt/^(^)(A(^)/(2v/iV^)) < {Cn^'^^f exp [Cn^''^'^' / N'^'} . 
1=1 

3. If moreover n^^^ =0{N^'^), 

k 

Entrf/„w(A(^V(2v/iV^)) 



i=l 



Entrf/„(.)(AS::V(2v/iV^)) + o(C 



i=l 
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as N ^ +00, where A\^^ is as in Example \1.3.^ and the implicit 
constant in o(- ■ ■ ) may depend on k, Cq, and n/N^^^. 

There are several ways to deduce Theorem II.2.4I from Theorem II. 5. 3[ For 
example, one may use Levitan's uniqueness theorem [13] for the transform 



a\/x 



-djj,{x) 



which appears naturally from the asymptotics of f/„ near ±1. However, the 
justification of convergence makes this approach quite cumbersome. 

We shall follow Soshnikov's original argument |21j and go back to mo- 
ments and to the Laplace transform 



exp{ax)dfi{x) 



Proof of Theorem 1. 2. 4 . We shall use the following simple identities (see e.g. 
Snyder [19]): 



2m 



X 



X 



2m-l 



1 ( 

1 ^ / 2m 

— — - > 2n f/< 

(2m)22'»-i ^ \m-nj 



2m + 1 
m — n 



Uonix) 



2n~l[X 



(1.5.2) 
(1.5.3) 



Let us show that 

1. Etr(A(^V(2v^))" < g7^exp(C2mVAr2), where Ci,C2 may depend 
on Co; 

2. Etr(A(^)/(2yiV))'" = Etr(4^V(2v^))'" + 0(1) for m = 0{N^I^), 
where the implicit constant in o(l) may depend on Cq and on m/N"^/^. 

(This is more or less the content of Theorem 2 in [2T].) Substitute 

X = A(^V(2v/iV^) 

in (II.5.2P and take the expectation of the trace: 

.27^^. 

-j^ m 

2m + 122™^^ 

^ ' n=0 



Etr 



2m + 1 
m — n 



EtiU- 



2n 



AW 



(L5.4) 
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TERMo =1 1 < 



The 0-th term in (11.5.4^ is equal to 

)n< 

By the second item of Theorem II. 5. 31 

TERM, < {2n + 1) + ^\ Cn eMCn""/^ /N^'^) 
\m — n J 

< 9^ expf-cnVm + Crx'/^IN^'^) . 
\/rn 

Thus 

AW 



(1.5.5) 



Etr 



^ C22™ 



1 f " 

— =— <^ iV + V expf-cnVrw + Crv^'^/N^'^ 

\ n=l 

CN 



This proves 1. 

The inequahty 01. 5. 5^ also ensures that the contribution of 

n > C'm^/N + N^l^ 

(with, say, C = 10) is negligible. Hence one can restrict the sum to 

n < C'rr?/N + N^'"^ , 

and apply the third item. This proves 2. for even values of m; for odd values 
of m, both sides are zero. 

Proceeding with Soshnikov's argument, we deduce that the sequences 
{^^^^}, {t]'^^^} are precompact, and that 

(4iV)-'"-/2Etr(A(^))"-^ j eMo^y){{-irdp,^^{y) + dp,M) 

for any limit points r/, as long as mjv/A^^/^ q, mN is of constant 
parity p. Therefore the Laplace transforms ii(pi^g), ii(pi,,y) do not depend on 
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the distribution of the entries of the matrix A^^\ In exactly the same way 
we show that £(pfc^g), £(^^,77) are defined uniquely for any k > 1, and hence 
are the same as for a[^J . Therefore (again, see [21]), we deduce that 



D 



□ 



Theorem 1.5.4. Fix (3 E {1,2}, and let {B^^^} be a sequence of random 
matrices satisfying the assumptions of Theorem Fix k > 1, and let 



{{n[^\ ■ ■ ■ jn]!"')}]^ be a sequence of k -tuples. 

1. Suppose X]^!^'' = n''^\ There exists a constant C (depending only on 
Cq in (A2)), such that 



BW - (M(iV) +N-2) 

nr,MiN)/N 1^2v/(M(iV)-l)(iV-l) 

< (Cn(^))'= exp {Cn^'^y'^^/MiNy/^ 



1=1 



2. If moreover n^^^ = 0{M{Nf/^), 

11 n\ \M{N)/N I 2^{M{N)-l){N -I) 



- (M(iV) + AT - 2) 



i=l 



nr\MiN)/N I 2^{M{N)-1){N 



I] 



+ o{{n 



(iV)^fc^ 



as N ^ +00, where B, 



(N) 



IS as m 



Example I/. 3. 1\ and the implicit 



constant in o(- ■ ■ ) may depend on k, Cq, and n/M(NY^^ . 
Similarly to the above, Theorem II.5.4I implies Theorems II.2.2|I.2.3[ 

Sketch of proof of Theorems I/. 2. 2^1. 2. 31 As in the proof of Theorem lI.2.4t we 
consider moments. Expressing 



Etr 



_ (M(A^) + iV - 2 
2v/(M(Ar) - l)"(ArrT] 



2m 



+ Etr 



_ (M(A^) + - 2) 
2^{M{N)-l){N-l) 



2m-l 



(1.5.7) 



14 



in terms of 



EtrK 



n,M{N)/N 



B(^) - {M{N) +N-2) 
2v/(M(iV)-l)(iV-l) 



one may check that the asjTiiptotics of fll.5.7p is the same as for B^^^ = B- 
If M{N) /N < 1 - r] < I, the same is true for 



(N) 



Etr 



- {M{N) +N-2) 
2^{M{N)-1){N-1) 



n 2m 



Etr 



- (M(iV) +N-2) 
2y/{M{N)-l){N-l) 



2m-l 



(1.5.8) 



(with the imphcit constants depending on i].) From this point, proceed as in 
the proof of Theorem II.2.4[ □ 



Remark 1.5.5. Taking Remarks lII. 3 . SliIV. 1 . 6{ into account, one can actually 
avoid the use of any results for Wishart matrices, and compare the correlation 



measures to those in Theorem 1.2.4 



Plan of the proceeding sections. Parts [IT|IIII are devoted to the proof of 
Theorem II. 5. 31 In Part[TTlwe focus on the special case of matrices the entries 
of which are uniformly distributed on the {(3 — l)-dimensional sphere (except 
for the diagonal entries, which are zero, see (III. 0.11) below.) We discuss the 
asymptotics of the expectations in Theorem II. 5. 31 in detail, first for k = 1, 
and obtain a certain "genus expansion" , Proposition lII.2.5l In Section lII.3l we 
extend these results to arbitrary k > 1. This part is based on the connection 
to non-backtracking paths on the complete graph, which is very explicit and 
simple in the special case f lll.O.ip (see Claim [IT. 1.21 below). 

In Part IIIII we show that the results of Part [TTl can be extended to matrices 
with arbitrary distribution of entries (that satisfy the conditions of Theo- 
rem [LL3l) The three main technical difficulties that appear are: 

1. to express trt/„(A/(2v/iV^)) as a sum over paths; 

2. to show that multiple edges do not contribute to the part of the asymp- 
totics that comes from non-backtracking paths. 

3. to show that paths with backtracking do not contribute to the asymp- 
totics of the expressions in Theorem II. 5. 31 
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In Part IIVI we prove Theorem II.5.4[ The asymptotics of the expressions in 
Theorem lI.5.4l is closely connected to non-backtracking paths on the complete 
bipartite graph. Therefore the proofs mostly mimic the proofs in Parts [IT|IIH 
and we mainly indicate the necessary modifications. 

Part |V] is devoted to extensions and some remarks. We discuss additional 
results that can be proved using the methods of this paper, and indicate the 
modifications that should be made in the proofs. In particular, we discuss 
quaternionic random matrices (which correspond to /5 = 4), and matrices 
with unequal real and imaginary part. In Section [V. 21 we discuss some devi- 
ation inequalities for the extreme eigenvalues. 

Notation: The large parameter in this paper is oo. For quantities 0, il^ 
depending on A^, we write -C for = o{ijj), and (p ^ ip for (p/ip = l-|-o(l); 
= e(V') if = Oiip) and = 0(0). The letters C, C',Ci,--- will stand 
for positive constants the value of which may vary from line to line. Some 
of these may depend on Cq in (A2) or on other parameters; we mention it 
explicitly when this is the case. 



Part II 

Matrices with uniform entries 



From this point, we suppress the dependence on in the notation. 



In this part, we focus on the special cases 




±1 with prob. 1/2, u v , 



u = V \ 



(II.O.l) 



II. 1 Reduction to diagrams 



Consider the following sequence of polynomials P„ = Pn,N'- 
Poix) = 1, Pi(x) = X, P2ix) = x^-{N-l), 
Pn{x) = xPn-i{x) -{N - 2)Pn-2{x) ioi n > 3. 



(II.l.l) 
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Lemma II. 1.1. The following identity holds: 
Pn{x) = {N- 2)"/2 



X 



1 , / X M (II.1.2) 



where formally f/_2 = f^-i = 0. 



Proof. For n = 0,1,2 the identity (111.1.2^ follows directly from ( lll.l.ip 



(iLOl . Next, fITXTll implies (cf. [T9]) that 

Un{y) = 2yUn-i{y)-Un^2{y) , n = 2,3,-- - . (II.1.3) 



Taking y = x/ {2\/N — 2), we see that the right-hand side of (]II.1.2|) satisfies 



the same recurrent relation as the left-hand side. □ 

Claim II. 1.2. For any Hermitian NxN matrix A with zeros on the diagonal 
and other entries on the unit circle, 

Pn{A)uoU„ ^ ^ ^M0Ml^HlM2 ■ ■ ' ^Un-lUn ) (II. 1.4) 

Pn 

where the sum is over all paths Pn = uo'^i ■ ■ ■ such that 

(a) Uj ^ Uj-i for i = 1, ■ ■ ■ ,n; 

(b) Uj 7^ 'Uj_2 for j = 2, ■ ■ ■ ,n (the non-backtracking condition). 
Proof. For = 0, 1 the identity (III. 1.41) is trivial. For n > 2 observe that 

Pn{A) = Pn-i{A)A -{N- 2)P„_2(^) (11.1.5) 
according to flll.l.Sp . and on the other hand 



A A 



1, U V 
0, U = V 



and hence the right-hand side of (III. 1. 40 also satisfies (III. 1.50 . □ 

By Claim [11.1.21 the expectation EtrP„(y4) is equal to the number of 
paths Pn = uqUi ■ ■ - Un that satisfy the conditions (a),(b) (above) and (c),(d^) 
(below) : 
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(c) Un^Uo; 

{d[) for any u v, 

# {j I = ^j+i = ^} = # {j I % = ^> ^j+i = mod 2 ; 
(d2) for any u ^ v, 

# 0' I % = %+i = f^} = # {j I Uj = i;, -Uj+i = u} . 
In particular, EtrP2n+i(^) — 0, therefore we shall only study 

Ej = Ej(2n) =EtrP2n(^) • (11.1.6) 

Let p2n = UQUi---U2n be a path satisfying (a), (b), (c), (d^). Con- 
sider a directed multigraph G = (V,£^dir), where V C {I,-- - , A^} is the 
set of all vertices Uj, and E'dir is the set of edges {uj-i,Uj) (with multiplici- 
ties) . A matching of p2n is a matching (= involution without fixed points) of 
{0, 1,-- - ,2n-l}, so that 

• for /3 = 1, every edge {u, v) is matched either to a coincident edge {u, v) 
or to {v, u); 

• for /9 = 2, an edge {u,v) is matched to {v,u). 

A path together with a matching will be called a matched path. 

Denote by S^™(2n) the number of matched paths (satisfying (a), (b), (c), 
(d^)), and denote by S^(2n) the number of paths satisfying (a), (b), (c) and 
the stronger condition (dp): 

(di) for any u ^ v, 

# {j I = u, Uj+i = -y} + # {j I Uj = V, Uj+i ^u} e {0, 2} . 

(d2) for any u ^ v, 

# {j I «j = Wj+i = ^'l = # {i I = -v, Uj+i = m} e {0, 1} . 
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Obviously, 

E^(2n) < Sj(2n) < Sj'"(2n) . (III. 7) 

Our next goal is to study the asymptotics of S^™'(2n). In particular, we 
shall prove that 

Ej™(2n)<S^(2n)(l + o(l)) 

as long as n = o{N^^'^). 

Let us introduce some more graph-theoretical notation. 

Definition II.1.3. Let (3 G {1,2}. 

• A diagram of type /? is an (undirected) multigraph G = (V, E), together 
with a circuit p = UqUi ■ ■ - Uq on G, such that 

— p is non-backtracking (meaning that no edge is followed by its 
reverse, unless the edge is uu and /5 = 1); 

— For every (m, v) G E, 

# {j I Uj = u, Uj+i = t;} + # {j I Uj = V, Uj+i = u} = 2 (/3 = 1) , 

# {j I Uj = Uj+i = v} = if{j\ Uj = V, Uj+i = u} = 1 {(3 = 2) ; 

— the degree of Mq in G is 1; the degrees of all the other vertices are 
equal to 3. 

• A weighted diagram is a diagram G together with a weight function 
w -.E^ {-1,0,1,2,- ■■}. 

Let us construct a mapping from the collection of matched paths sat- 
isfying (a), (b), (c), (djj) into the collection of weighted diagrams (of type 
p.) 

(i) Start with the multigraph G = G{p2n) = (y,E^ij.) corresponding to the 
path P2n- 

V = {U \ 3j, Uj = U} , Edir = {{Uj,Uj+i)} , 

and unite each pair of matched edges into a single undirected edge. 

(ii) If the degree of uq is greater than 1, add a vertex r connected to uq, and 
replace p2n with tuqUi ■ ■ ■ uqt. Otherwise set r = uq. 

(Hi) For every vertex u ^ r oi degree > 3, replace u with < d — 2 vertices 
of degree < 3 using the inductive procedure illustrated in Figure [H 
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(iv) Erase all the vertices of degree 2. 

(v) Set 




Figure 1: Splitting a vertex of high degree (/? = 1) 
The above construction yields 

Claim II. 1.4. There are at most A^#^+ "'('^) matched paths corresponding 
to a weighted diagram {G,p,w). If w{e) > 1 for every e & E, there are 
exactly 

N{N -1)...{N-{W + Y, ^(e)) + 1) (n.1.8) 

e 

such paths. In particular, ifw{e) > 1 for every e & E, and if 

e 

the number of matched paths and the number of paths (without a matching) 
are both 

II. 2 Counting diagrams 

Let us present an automaton which constructs all possible diagrams. Con- 
sider first the case [3 = 2. 
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States: (t; ii, - ■ ■ , 4), where t,k >0 and ij > 0; initial state: t = k = 0. We 
can visualise the state of the automaton as a "thread" made of t pieces, and 
k "loops" (the j-th loop is made of ij pieces). 

There are 2 transitions for (3 = 2: 

1. ( "creation" of a new loop, see Figure |2]): 

t^t' <t + l; ,4^4,-- - ,4,4+1 , 

where ik+i <t-t' + 2. 




Figure 2: Transition 1. 



2. ( "annihilation" of the j-th loop, see Figure [3]): 

t^t' <t + ij+2 ; 4,--- ,4 ^4,--- ,4 . 




Figure 3: Transition 2. for (3 = 2 



We impose the restriction t > all along the way, and demand that after 
some (even) number of steps s = 2g the automaton return to the original 
state and stop. 

Claim II. 2.1. Every diagram (corresponding to (3 = 2) is generated by the 
automaton. If the automaton stops after s = 2g steps, the diagram has 
jj^E = 6(7 — 1 = 3s — 1 edges and = Ag = 2s vertices. 
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Proof. It suffices to observe that the ffist (creation) step creates 3 edges and 
3 vertices, the last (annihilation) step creates 2 edges and one vertex, and 
every other step creates 3 edges and 2 vertices. □ 

Denote by D2{s) the number of diagrams corresponding to s steps (of course, 
D2{s) = for odd values of s.) 

For (3 = 1, the transitions are slightly different. First, every time a loop is 
annihilated (transition 2.), the automaton has to choose a direction in which 
the loop is passed. That is, there are two possibilities: the one in Figure [3l 
and the one in Figure HI 




Figure 4: Transition 2. (2"^^ possibility) for (3 = 1 



Also, we have a new transition 
3. ("creation and annihilation"): t < — t' < t + 1 (see Figure O) 




Figure 5: Transition 3. for (3 = 1 



Now the number of steps s can be written as s = 2g + h, where g is the 
number of steps of the ffist kind, and h is the number of steps of the third 
kind. Similarly to Claim [IT. 2. 2t we have 

Claim II. 2. 2. Every diagram (corresponding to (3 = 1) is generated by the 
(new) automaton. If the automaton stops after s = 2g + h steps (with g, h 
as above), the diagram has = 6g + 3h — 1 = 3s — 1 edges and = 
4g + 2h = 2s vertices. 
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Denote by -Di(s) the number of diagrams corresponding to s steps (and 
P = l). 

The following crude estimate will be of use: 
Proposition II.2.3. For j3 E {1,2}, 

{s/cy < Df^is) < {CsY , 

where C > 1 is a universal constant. 

Proof. Let us consider for example the case (3 = 2 (the argument for (3 = lis 
similar.) Let s = 2g; the number of loops after j steps is non-negative, and 
zero at the beginning and at the end. Hence the number of ways to order 
the transitions of the two types is exactly the Catalan number 

{2g)\/{g\{g + l)\)<4^ . 

Denote by rrii the number 2 — (increase in t + Yl^j) i-th step. Then 

rrii > and mi + ■ ■ ■ + m2g = 4:g. Therefore the number of ways to choose 
the numbers m,- is at most 



6(7-1 
4g-l 



< (3e 
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The number of diagrams corresponding to a fixed order of transitions and 
fixed rrii is at most [GgY^. This proves the upper bound. 

To prove the lower bound, consider the fixed sequence of transitions 
1.2.1.2. ■■■ 1.2., and rrii = {1 < i < 2g.) It is not hard to check that 
the number of diagrams thus restricted is equal to 



l[{4^-3){2^-l)>{g/Cr^ 



i=l 

□ 

Remark II. 2. 4. Observe that 

Di(l) = l, D2(l) = 0, 1^2(2) = 1 

(see Figure Therefore the upper hound in Proposition \II.27^ can be 

formally improved to D[^{s) < C^'^s^ (perhaps, with a different constant 
C > 0). 



23 





Figure 6: The simplest diagrams: s = P = l, g = 0,h=l (left), s = /3 = 2, 
g = l (right) 

The preceding considerations allow to prove (a more precise form of) Theo- 
rem [L5l3] for the special case (III. 0.11) . k = 1. 

Proposition II. 2. 5. Let (3 G {1,2}, and let the random matrix A he as in 
nLUl\) . Then 

1. Etr[/2„+i {A/{2^/W^)) = 0; 

2. EtrU2n{A/{2y/If^)) < nexp(Cn3/2/A^i/2). 

3. forn = o{y/N), 

Etrf/2„ [A/{2VN^))={l + o{l))nY,{n'/Nr'j^^_ . 

Remark II. 2. 6. For future use, denote 



^,(n;]V) = f^((n/2)ViVr'(^,, 



S>1 



(for any N E ^+-) Then for n <^ N^^"^ 



Etr?7„(A/(2v/iV^)) = (l + o(l))|</>^(n;iV) + (-l)>^(n;iV)| . 

Proof of Proposition \IL2.5[ The first statement is obvious. 
According to flITX6|) . fUTXTl) . 

EP2niA) = Sj(2n) < Sj"^(2n) . 

Every matched path corresponds to some weighted diagram with a certain 
number of steps 1 < s < n. For this diagram, j^V = 2s and j^E = 3s — 1 by 
Claims [ILIJIILMJ Also, 

w{e) = n — jj^E = n — 3s + 1 . 
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The number of ways to place the weights on the diagram is at most 

" + <(n + 3. -2)^^-/(3. -2)!. 

By Claim III. 1.41 the number of ways to choose the vertices is at most 
^2s+n-3s+i ^ ^n-.+i_ gg^^^g Propositlou [HU (and Remark dlMl) 

l<s<n ^ 

- ^ 3s-2 ! 

l<s<n ^ ' 

Thus 

EtS^^ < nexp(C3n/iV + C2n'"^/N^/^) < nexpiC,n'/yN'/^) . 
[N — 2)" 

Hence by Lemma [II. 1.11 
A 



EU. 



2n 



fc>0 



This proves the second statement. 

Fix 1 <^ 77-0 <^ N^^'^. Suppose n <^ N^^"^. If n > no, choose sq so that 



max 



(l,n=^/ViVV2)«,„«^i/2 
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Then by Proposition III. 2. 31 



\3s-2 



E-(2n) < Y: ^/^(^)^""^^'^^^7^rW 

< y: ^/.(^)A^-^^^^^^f^lP (1+0(1)) 



l<s<so 



(II.2.1) 



< Y D,m-^'——-^{i+o{i)) 

1<S<S0 ^ ^' 



^3s 2 



<^D,{s)N-^^^j^—-^_ (1 + 0(1)). 



On the other hand, for every diagram corresponding to a certain s > 1, there 
are 

.3s-2, 

ways to place the weights so that w(e) > 1 for every e G -E; for s ^ n^^"^, 

— 3s\ n^^^"^ 



'is -2) (3s -2)! 



(1+0(1)). 



If the weights are placed in this way, the number of ways to choose the 
vertices is 

N^~'+\l + o(l)) , 

according to the second part of Claim III.1.4[ Every path thus constructed 
satisfies the condition (d^) and hence has a unique matching. Therefore 

Sj-(2n) > S^(2n) > Dp(s)N--^^'-f^^ (l + o(l)) 

\'s. (n.2.2) 

>5:D,(.)iV-^^^^-^ (l+o(l)). 



Ks 



where on the last step we have used Proposition III. 2. 3) again. The inequalities 
(JILMl), fllL2:2|) yield: 



^3s 2 



Sj-(2n) ~ S,(2n) - E ^/^(^)^""^'''7^— W 
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whence by flITX6|) . fUTTTl) 

Etr P2n(A) -Y.ms)N--'^'j^^_ . (11.2.3) 

If n < no, a similar argument shows that 

3/3-2 



and hence ( 1II.2.3P is still true. 



Applying Lemma III. 1.11 as in the proof of the second statement of this 
proposition, we deduce the third statement. 

□ 

II. 3 Product of several traces 

In this section, we shall consider the expectations 

EtrP„,(A) trP„,(A)---trP„,(A) 

for k > 1, which we need in order to study 

Etrf/„,(A/(2v/iV^)) tr f/„,(A/(2v/iV^)) ■ • -tr f/„,(A/(2v/iV^)) . 

Mutatis mutandis, the analysis will be quite similar to the case k = 1, which 
we have considered in the two preceding sections. 
According to Claim III.1.2t 

EtrP„,(A)trP„,(A)---trP„,(A) =S^(ni,--- ,nfc) , 

where Sjj(ni, ■ ■ ■ , n^) is the number of A;-tuples of paths (or shortly: /^-paths) 

Pni,---,nfe = UqUi ■ ■ -U^^^UqUq ■ ■ ■ U^^, ■ ■ ■ , UqU^ ■ ■ ■ 

that satisfy the conditions 
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(a) -u* 7^ M*,.^ ioT i = 1, ■ ■ ■ , k and j = 1, ■ ■ ■ , n^; 

(b) -u* 7^ M*_2 for i = 1, ■ ■ ■ , A; and j = 2, ■ ■ ■ , n^; 

(c) = for i = I,- - ■ ,/c; 
(d^) for any u ^ v, 

^ = j) I ^ = ^, ^+1 = mod 2 , 

^{{i,j)\ui = u,u)^^ = v} 

= j) I u] = = u} , 

As in Section III. 11 we also consider matched fc-paths, that is, /c-paths 
together with a matching (= involution of l±l[L]^{0, 1, ■ ■ ■ , rij — 1} x {i} without 
fixed points) such that 

• for /5 = 1, every edge {u, v) is matched either to a coincident edge {u, v) 
or to {v, u); 

• for /3 = 2, an edge {u, v) is matched to {v, u). 

Denote by S^™(ni, • ■ ■ , n^) the number of matched /c-paths satisfying (a), 
(b), (c), (d^), and by S^(ni,-- - ,nfc) the number of /c-paths satisfying (a), 
(b), (c), and {dp) below: 

(d^) for any u^v, 

+ #{(z, j) I u) = V, = n} G {0, 2} , /3 = 1 ; 

I 4 = = v} 

= I u) = V, u)^, = u}e{0,l} , /3 = 2 . 

Similarly to fUTTTll . 

S^(ni, ■ ■ ■ , rifc) < ■ ■ ■ , n^.) < ■ ■ ■ , rifc) . (II.3.1) 

Next, we extend the definition of a diagram (Definition III. 1.31) in the 
following way: 
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Definition II.3.1. Let P G {1,2}. 



A k-diagram of type /? is an (undirected) multigraph G = (y,E), 
together witli a fc-tuple of circuits 

p = ulul---ul, ulul---ul, u^Ui---Uo 

on G, such that 

— p is non-backtracking (meaning that in every circuit no edge is 
followed by its reverse, unless jS = 1 and the edge is uu); 

— For every (m, v) G E, 

# = u, = v} 

+ #{j|w}=^,w}+i = ^}=2 (/9 = 1), 



u] = u, 



= #{j|w} = t;,M}+i = w} = l {P = 2); 

— the degree of Mq in G is 1; the degrees of all the other vertices are 
equal to 3. 

A weighted k-diagram is a fc-diagram G together with a weight function 
w -.E-. {-1,0,1,2,- ■■}. 



The mapping from the collection of matched fc-paths satisfying the (new) 
conditions (a), (b), (c), (d^) to the collection of weighted fc-diagrams is con- 
structed exactly as for k = 1, and Claim [IT. 1 .41 remains true verbatim. 

To make the automaton from Section III. 21 generate fc-diagrams, we start 
from the same initial state t = k = 0, and demand that the automaton return 
to the same initial state after s steps, and that t = exactly /c + 1 times 
during the procedure. That is, t = after 0, si, si + S2, ■ ■ ■ , si H — ■ Sk steps, 
where Si, ■ ■ ■ , > are some numbers such that Si + ■ ■ ■ + = s. 

Claims III. 2. 21 and III. 2. II take on the following form: 

Claim II. 3. 2. Every k-diagram is generated by the automaton, with the 
new restrictions. If the automaton stops after s = Si + • ■ ■ + steps (with 
Si, • • • , Sfc as above), the k-diagram has ^E = "^{Ssi — 1) = 3s — k edges 
and = ^ 2si = 2s vertices. 
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Denote by D^k^s) the number of fc- diagrams generated in s steps, and 
let Dp^si, ■ ■ ■ , Sfc) be the number of fc-diagrams corresponding to Si, ■ ■ ■ , s^; 
that is, 

Dp,k{s)= ^ Dp{si,--- ,Sk) . 

siH \-Sk=s 

Then Proposition III. 2. 31 can be extended in the following way: 
Proposition II.3.3. For (3 G {1, 2], 

{s/Cr^'-'/{k - 1)! < Dp^kis) < {Csr+'-'/{k - 1)! . 

Now we can extend Proposition III. 2. 51 to all /c > 1, in the following 
(slightly weaker) form: 

Proposition II. 3. 4. Let (3 e {1, 2}, and let k > 1, (ni, ■ ■ ■ , Uk) E N''. Also 
let A be an N X N random matrix as in /III.0.1\) . Then 

1. IfY.ni = l mod 2, 

k 

ETTtrf/^(^,(A(^)/(2v/iV^)) = . 

i=l 

2. IfJ2ni = 2n = mod 2, 

k 

E JJtr[/„^(A/(2VAnr2)) < (Cn)^ exp {Cn^/y N^/^} . 

i=l 

3. If moreover n <^ A^^^^, 

k 

E n tr f/„(^) (A(^)/(2v/iV^)) ~ S^(ni, ■■■ ,nk) 

i=l 

as N ^ +00. 

Proof. As in Proposition III.2.5t the first statement is obvious. 

The weights w{e) satisfy a system of linear equations (depending on the 
diagram) : 

^ w(-u*, -u*.,.!) = nj, i = l,---,A;. (II. 3. 2) 

3 
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For every e G i? and every i, the coefficient Cj(e) of w{e) in the i-th equation 
is 0, 1, or 2, and 

k 

^Q(e) = 2. 
1=1 

As w{e) > -1, 

w{e) < ^ =n + k , 

where the sum is over all the edges except the ffist one in every circuit. 
Therefore 

^{w{e) + 2) <n + k + 2(3s - 2k) = n + 6s - 3k , (II.3.3) 
and the number of ways to place the weights is at most 

< {n + 6s-3kf'-^y{3s-2k)\ 
The number of ways to choose the vertices is at most A^^-^+'^j hence 



n + Qs — 3k 
3s - 2k 



E-fn. ■•• n.X V Nn-s^^ ^^^r'" jn + 6s - 3k) 



3s-2k 



k<s<n 



This proves the second statement. 

The proof of the third statement is similar to the proof of the third statement 
in Proposition III.2.51 Choose 1 <^ rio -C A^^^^. We write 

{!,■■■ ,k} = h\Sl2 , 

where Ii is the set of indices such that nj < (and I2 is its complement). 
The circuits corresponding to z G /i are (typically) trivial; for the circuits 
corresponding to i G I2, we consider the system of equations (111.3.20 and 
prove that most of its solutions satisfy w{e) > 1. This is again similar to the 
proof of Proposition III.2.5j we omit the details. □ 
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Remark II. 3. 5. Extending Remark \II.2.6[ one can write 



k 

i=l 

= (1 + o(l)) Yl (-1)^'^^^"' Mi^ihei; N) MM^0■^ N) , 

IC{l,-,k} 

where now 

(e.g. (f)fs{0,N) = 1, and (f)p{{n}, N) is as in Remark \II.2.6i ) 

Remark II. 3. 6. The number D2{2g) is equal to the number of homotopically 
distinct ways to glue the boundary of a obtaining a compact orientable 

surface of (orientable) genus g. There is a similar interpretation for (3 = 
1: Di{s) is the number of homotopically distinct ways to obtain a compact 
surface of non- orientable genus s. 

One can extend this observation to Dp^^^s) (which corresponds to gluing 
k disks); this could be compared to the Harer-Zagier formulcB, cf. ITSl 6.5.6]. 



Part III 

General matrices 

In Part [Tll we have proved a version of Theorem II. 5. 31 for the special case 
(III.O.ip . In this part, we extend the considerations of Part Hi] to general 



matrices A that satisfy the assumptions of Theorem 11.1. 31 

Unfortunately, the nice formula (]II.1.4p is not valid for general matrices 
A. Instead, for every path p and matrix A, we shall define an expression 
7(p, A), such that for every 1 < u.,v < N and every ri > 0, 

(AT - 2)"/2f/„(A/(2v/iV^))„„ = 7(P, A) , 
where the sum is over all paths p of length n from u to v. 



^with a marked point on the boundary 
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III.l Extending Claim 111.1.2 



To any path pn = uqUi ■ ■ - Un we associate an expression 7(pn, A) and a sub- 
path G{pn) that satisfies (a),(b). Namely, set 

7(Po, -4) = 1, e{po) = po , 

and proceed as follows: 

1: If Un = Un-U 

1: if n > 2 and C(p„_2) = uq, set 

l{Pn, A) = 7(p„„l, A)Au„_,u„ + 7(Pn-2, A) , e{pn) = e(p„-l) ; 

2: else, set 

liPn^A) = -f{pn-uA)Au^_^u„ , e{Pn) = G{pn-l) ■ 

2: Else, 

1: if {un,Un-i) is the last edge edge of C{pn~i), 

1 if the previous step of type 2 was not of sub-type 2:1 (or did not 
exist), set 

7{Pn,A) = 7(p„_i, A) {\Au„_^uS - 1} 

= l{Pn-U A)Au„_,u„ - l{Pn-l, A) , 

where Pn-i is obtained from Pn-i by erasing the last edge in G{pn-i), 
and let C(p„) be C(p„_i) without the last edge; 

2 else, set 

liP^A) =-f{pn-uA)Au„_^u„ , 

and again, C{pn) is C(p„_i) without the last edge; 
2: else, set 

l{Pn,A) = 7(p„_i, A)v4„„_j„„ , 

and append to C(p„_i) in order to obtain G{pn)- 
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Claim III. 1.1. For any Hermitian N x N matrix A, 

(iV-2)"/2f/„(A/(2v/Ar^))„„„„ =5^7(p„,A) , (III.l.l) 

Pn 

where the sum is over all paths Pn = UqUi ■ ■ - Un- 

Sketch of proof. We check the claim for n = 0, 1 and proceed by induction. 
Suppose ( llll.l.ip holds up to n — 1. Then by ( 111. 1.31) 

(iV-2)"/2[/„(A/(2v/iV^))„„.„ 

= $^'7(Pn-i,v4)A,_,,„ - (iV-2)^"7(g„_2,A) , 



Un~l Pn-\ Qn-2 



where is over paths Pn-i of length n — 1 from uq to and is over 
paths g„_2 of length n — 2 from Uq to m„. 

In the first sum, set p„ = Pn~iUn', it is then equal to 



J2 liPn, A) - J2^'\{Pn-2, A) + Y^'^'\{pn-l, A) 



Pn 



(where the sum is split according to the construction above.) The map 
~: Pn ^ Pn-i is almost — 2 to 1. Namely, 



Thus 



A^-l, e(g„_2)=Mo 
N-2 . 



gn-2 e(g„_2)=M0 



But the last sum in (1III.1.2|) is exactly ^ ' . Thus finally 



(iV-2)"/2f/„(A/(2v/iV^)U„„ = 5^7(Pn,A) . 



Pn 

□ 
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By Claim lllLLll 



(iV-2r/^rf/„(A/(2v/iV^)) = ^7(p„,A) , (III.1.3) 

Pn 

where the sum is over all paths Pn satisfying (c) . Every path pn is decomposed 
into a non-backtracking part C(p„), the 'loopsou„_i = and the remainder 
3^1 (Pn) 5 which is a forest (= union of trees.) 

III. 2 Non-backtracking paths 

According to (]III.1.3|) . 

{N -2r/^EtTUn{A/{2VN^)) = J2^^iPn,A) , (III.2.1) 

Pn 

where the sum is over all paths satisfying (c) in which every edge is passed 
an even number of times. This expression is zero for odd n. In this section, 
we let P = 1 and focus on the sub-sum 

^1 (2^) = ^ ] '^AuoUiAuiU2 ' ' ' ^M2n-l«2n 

over paths p2n satisfying (a), (b), (c), (d^). 
Lemma III. 2.1. 

1. S?'^(2n) <nexp(C"n3/2/Ari/2); 

2. for n = o{^/N), Si'^(2n) = Si(2n)(l + o(l)). 

Here C > and the implicit constant in o(l) depend only on Cq from (A2). 
By (A3i), 

Si'^ > S}(2n) > Si(2n) 

(with Il\{2n), Ei(2n) as in Section Hi. ip . Therefore we only need to prove the 
upper bounds. For a constant C > 0, denote 

E}'"'^(2n) = #^(P2n) ^ 

P2n 

■^these are called 'loops' in the standard graph-theoretical terminology, not to be con- 
fused with loops in the sense of Section 111.21 
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where now the sum is over matched paths p2n, and ^E{p2n) is the number 
of distinct edges in p2n- By (A2), 

E?'^(2n) < S}™'^(2n) 

for some constant C depending only on Cq (note that a factor {k/C'Y from 
(A2) is absorbed in the number of matchings.) Thus we may restrict our 
attention to ZlJ™''"(2n). 

For any weighted diagram corresponding to a path n — ^E{p2n) < b, 
where b is the number of edges e with w{e) = —1. This allows us to follow 
the proof of Proposition III.2.5I 

Proof of Lemma \III.2.1\ As in the proof of Proposition IIL2.5t 

l<s<n 6>0 ^ / V / 

l<s<n ^ ^ 6>0 

l<s<n 

From this point proceed exactly as in the proof of Proposition III.2.5[ □ 



III. 3 Backtracking paths 

Now let us estimate the contribution of all the other paths; we still assume 
that (3 = 1. 

Let p2n be a path that gives non-zero contribution to fllll.l.Sp . We de- 



compose it into q2{n-m) = G{P2n), a forest f2m = 3^{P2n), and the 'loops'. 
Recall that q2(n-m) (which may degenerate to a single vertex) satisfies (a), 
(b), (c), (di). Also, 

every leaf of /2m appears somewhere else on p2n (at least once). (III.3.1) 

These statements follow from the expression for 7(p„,y4) in Section [111.11 

The paths for which /2m is empty correspond to the expression E^'^(2n) 
that we have studies in the previous section. Let us show that the contri- 
bution of the other paths is negligible. The basic idea is to show that the 
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contribution of paths with forests is neghgible with respect to the contribu- 
tion of non-backtracking paths, where each tree of the forest is replaced by 
the simplest non-backtracking piece (Figure [6l right). 

To start the computations, we need a new kind of diagrams (cf. Defini- 
tions [nTSl [nXU) 

Definition III. 3.1. A tree diagram (or shortly, t-diagram) is a rooted binary 
planar tree (that is, a binary rooted tree with fixed imbedding into the plane). 
A weighted t-diagram is a t-diagram together with a weight function w from 
the set of edges to { — 1, 0, 1, 2, ■ ■ ■ }. 

Similarly to Section III. 11 we can attach a weighted t-diagram to every 
tree in the forest /2m- 

Lemma III. 3. 2. The number D^{€) of different t-diagrams with £ leaves 
satisfies 

D\i) < 4^ . 

Proof. A t-diagram with i + 1 leaves is obtained by gluing a leaf to a t- 
diagram with i leaves. The new leaf can be glued to one of the edges on the 
branch connecting the root to the last-glued leaf (see Figure [7]). Denote by 
di the index of the latter edge on the branch, so that di = 1 if the edge is 
adjacent to a leaf. Then 

e-i 

Y^di<2{i-1) ■ 

i=l 

therefore the number of ways to choose the indices di is at most 
This proves the inequality. 

□ 

Let us bound the number of ways to construct a tree t2m'- The number 
of edges on a t-diagram with i leaves is 2£ — 1; hence the number of ways to 
place the weights is thus at most 
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Before 5th step 



After choosing d=2 



Figure 7: Adding a leaf to a tree 



(since ^ < m'.) 

According to (1III.3.1I) . every leaf should appear somewhere else on p2n- 



Thus we can assume there are ii < 1/2 pairs of coinciding leaves, and every 
one of the remaining I — 2ii coincides with some edge that is not a leaf. The 
number of ways to choose the vertices on the leaves is therefore at most 

^ [ i\ e-2iA^j^2i.^fiC^Nr^' , ^ = mod 2 

oJ^i/2 ^2^1/ ~ \nN-\Cmy/^ , i=l mod 2 

(since n < A^^/^.) 

The number of ways to choose the other vertices is at most A^"* Hence 
the number of ways to choose all the vertices is at most 



(III.3.2) 



' N"''{Ci/Ny/\ i = mod 2 
nN'^'-^Ci/Ny/^ , i=l mod 2 . 

Thus the total number of trees is bounded by 

AT-' I 4:'iC,m'/£Y'-\C£/N''Y/^ 

e=0 mod 2 

+ "^'iCim'/if-^-iCi/Ny/^^ < C2N"'-\m'^ + n) 

e=l mod 2 



A more careful computation (in the spirit of Section [Ill.2p shows that the 



last estimate remains valid if we count every tree with a weight, depending 
on higher moments of A, and take the 'loops' into account. 
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Therefore the number of t-tuples of trees with m edges is at most 



J2 UC^N^^-^m'^ + n) , 

mi-\ \-mt=m j=l 

and the total contribution of paths p2n with t trees and m edges on these 
trees is at most 



^ miH \-mt=m j=l 



It is not hard to check that the sum of these terms over m, t > is neghgible 
with respect to Si(2?t,), for n = o(\/iV). 

This proves the claim at the beginning of this section; namely, item 3. of 
Proposition III.2.5] is valid in the generality of Theorem 11. 1.31 (for (3 = 1). A 
similar argument allows to extend item 2. of Proposition III.2.51 

Proof of Theorem I/. 5. 31 We shall only sketch the argument for k = 1; the 
extension is straightforward. For /5 = 1, we have just proved the stronger 
conclusion of Proposition III.2.5I 

For (3 = 2, the error terms for A are dominated by those for A, where 

A 1-14 I 

and the random signs are independent above the diagonal; A satisfies the 
assumptions of the theorem with P = 1. □ 



Part IV 

Sample covariance matrices 



IV. 1 Proof of Theorem II. 5. 4 

As in the Hermitian case, we start with the special cases 
P = 1, Xuv = ±1 with prob. 1/2 , 



, 1^ (IV.1.1) 
/3 = 2, Xuv ~ unif(5^) . ^ ^ 
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Define a sequence of polynomials Q„ = Qn,M,N as follows: 
Qq{x) = 1 , Qi{x) = X - N , 

g„(x) = (x - (M + iV - 2))g„_i(x) - (M - l)(iV - l)Qn-2{x) 
Lemma IV. 1.1. 

x-{M + N -2) 



Q„(x) = ((M-l)(iV-l))"/2 f/, 



2v/(M-l)(iV-l)^ 
M-2 / x-(M + A^-2) \ -| 

Similarly to Lemma lll.l.lt Lemma IIV.I.II can be easily proved by induc- 
tion. 

Claim IV. 1.2. Let X be an M x N matrix with entries on the unit circle, 
B = XX*. Then 

Qn{B)ugU^ ^ ^ ^M0t)0^MHJ0^Hltil^H2fl ■ ■ ' -^Un-lVn-l-^UnVn-l ) 

Pn 

where the sum is over paths Pn = UqVqUiVi ■ ■ -Vn-iUn in the complete bipartite 
graph Km,n ( that is, 1 < Uj < M , 1 < Vj < N), such that 

(b) Uj_i ^ Uj for 1 < j < n and Vj_i ^ Vj for 1 < j < n — 1. 

Proof. For n = 0,1, the verification is straightforward. Induction step: 

{Qn{B){B - Ni))^^^^^^ = (g„(5)gi(5))„„.„^, 

Pn+1 

where the sum is over paths Pn+i that satisfy (b), except perhaps for the 
inequahty 7^ Vn- Now separate the paths in 3 categories: 7^ f„; 

Vn-l = Vn, Un-1 7^ Un+l, Vn-1 = f„, = Un+l, which yield Qn+l (5)no«„+i , 

(M - 2)g„(5)„„„„^,, and (M - l)(iV - l)g„_i(5),o,„^,, respectively. □ 

Thus Etr Qn{B) is equal to the number of paths p„ = UqVqUiVi ■ ■ ■ Vn-iUn 
that satisfy (b) and also (c), (d^: 
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(c) Un = Uq] 

{d\) for any u, v, 

#{j I (Uj, Vj) = (m, v)} = #{j I {uj, Vj) = {v, u)} mod 2 ; 

(dg) for any u, v, 

#{i I = {u,v)} = #{j I {uj,Vj) = {v,u)} . 

Denote the number of such paths by E^(r;,). The remainder of this section 
is devoted to the following analogue of Proposition III. 2. 51 

Proposition IV. 1.3. 

1. ±}^{n) < Cn{MNY/'^e^^{Crv^/'^/M^/'^); 

2. zfn<^M^/^, 

Sj(n) = (l + o(l))(MiVr/^ 

{ (1 + v/M/iV) Mn, ^ jv-i/2)-2) 

+ (-l)"(l - ^/mJn) <Pp{n, - iV-i/2)-2) 

where 0^ is as in Remark \n.2.(A 

As in Section III.lt we consider matched paths; every (matched) path 
corresponds to a diagram. Let us study the number of paths corresponding 
to a given diagram. To place the weights, we need the following elementary 
lemma. 

Lemma IV. 1.4. The number of ways to represent a non-negative integer m 
as 

m = m[-\ \-m'^ + m'l-\ h ml (IV. 1.2) 

with m'j = 1 mod 2 and m'- = mod 2 is given by 



6{m, a, b) 



0, a ^ m mod 2 

( \+b-i ) ' « = 2 
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Proof. The first part is obvious. The second part follows from the equivalence 
between f llV.1.21) and 

m — a m'l — 1 m' — 1 m'/ m'! 

= — \ \ \ ^ \ . 

2 2 2 2 2 

□ 

Now consider a diagram corresponding to a certain s > 1 (in the sense 
of Claims III.2.2IIII.2.1[ ) Let Pn be a path corresponding to this diagram. 
Denote by V+ {VJ) the number of vertices of the (2"^^) type (that is, Uj 
or t>j, respectively). Denote by V+ (F_ = 2s — V^) the number of vertices of 
the 1*^* (2°^^) type on the diagram. 

Lemma IV. 1.5. In the notation above, 

n + 2 - \/+ n-2s^V+ 
t/-i- — 1 V— — . 



Proof. Let 

+ 1, w is of the first type 



(j{w) 
Consider the sum 



■1, w is of the second type . 



s= K^) + ^(^')), 

where the sum is over all the edges in the diagram. The root uq is of the first 
type; every other vertex w is counted with coefficient 3cr(w). Hence 

S=l + 3(V+ - 1) - 3(V_ - 1) = 1 + 3(2V+ - 2s - 1) . 

Therefore 

V^-V^ = V+-V---S = -V+ + s + 1 . 

2 

On the other hand, 

V^ + V^ = W + ^ w(e) = n + 1 - s . 
The statement follows. □ 
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The collection of paths corresponding to a given diagram and given choice 
of types of the vertices w & V is non-empty iS = n mod 2. Therefore 
the number of ways to choose the vertices on the diagram is at most 



Together with Lemma IIV.1.41 this proves the first statement of Proposi- 
tion IIV.1.3[ To prove the second part, we argue exactly as in the proof of 
Proposition III.2.51 item 3. 

Remark IV. 1.6. The extension to higher k > 1 is straightforward; instead 
of item 2., we obtain 



To prove Theorem II.5.4t it remains to extend these considerations to 
general matrices B. This is done along the lines of Part IIIII (actually, the 
argument is slightly simpler, since there can be no 'loops'.) 



V+=n mod 2 






(l + o(l)) (-(1 - VM/Ar))S'e^"» (1 + ^/M/N)^^^ 



IC{l,-,k} 
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Part V 

Extensions and further 
applications 

V.l Some extensions 

In this section, we outline the proofs of some results that are more or less 
straightforward extensions of what we have already considered. 

Matrices with quaternion entries. In addition to /9 = 1,2, one can also 
consider /3 = 4. Then Theorems 011 [1X21 HXSl \LT2\ OiSj OH remain 
valid, after the following modifications. 

Instead of complex-valued random variables, the entries of the matrices 
will be random (real) quaternions r = r^^^ + ir^^^ -|- j>(2) _|_ J^f(3)_ xhe assump- 
tions (Al),(A2) still make sense, with 

|r| = iy(r(0))2 _^ (^(l))2 _^ (^(2))2 _^ (^(3))2 . 

the analogue of (A3i), (A32) will be 
(A34) Er«r(^) = Sij/A, < z,j < 3. 



If y4 is an X (real-) quaternionic matrix, and Auv = A^u {A is 
"self-dual Hermitian"), one can consider the eigenvalues of A, which are real 
numbers Xi < ■ ■ ■ < Xn (see [IS]). To define the Airy point process 2ti4 and 
the distribution TW4, let 

r+00 

IK{x,x') = - K{x",x')dx' , 

J X 

K^{x,x)- 2 iK(2x,2x') K{2x,2x') J " 

Then the density of the correlation measures pk of 2ti4 off the diagonals is 
given by 

axi ■ ■ ■ axk V ^ ^ i<jj<fc 
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and the Tracy-Widom distribution TW4,- by its cumulative distribution 
function 



Fi(i) 

The Tracy-Widom theorem (formulated in Section II.4P holds also for /3 = 4; 
see [25] for more details. 

The role of Examples II.3.2III.3.1I is played by the Gaussian Symplectic En- 
semble (GSE), 

^(^) f iV(0, 1/4) + ziV(0, 1/4) + jiV(0, 1/4) + kN{0, 1/4) , u v 
™ ^ \iV(0,l/2), u = v, 

and the quaternionic Wishart ensemble, 

X W ~ iV(0, 1/4) + «iV(0, 1/4) + jN{0, 1/4) + kNiO, 1/4) , 

respectively. Theorems lI.2.4III.2.2III.5.4l are known to be true in this particular 
case (of course, with /3 = 4). 

To prove the theorems for matrices with arbitrary entries, we extend The- 
orems 11.5.31 and 11.5.41 Note that, for any self-dual Hermitian quaternionic 
matrix A with eigenvalues Ai < ■ ■ ■ < Aat and a (real) polynomial P, P{A) 
is well-defined and 

N 

trP(A) = 5^P(A,) . 

i=l 

Consider first the matrices 

^ |unif(S'), u^., ^^^ ^ 

I U, u = V , 

which are the quaternion analogue of (lll.O.ip . The relation (111.1.4^ remains 
valid for these matrices. Multiplication of quaternions is non-commutative, 
hence the expectation of a product of random quaternions depends on the 
order of terms in the product; see e.g. Bryc and Pierce [5] for a detailed 
analysis. 

However, one can easily show the following: 
Lemma V.1.1. For a path p2n = UqUiU2 ■ ■ ■U2n~iUo, denote 
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1. If A is as in KV.l.l 



), HP2n) \ < 1 for any path p2, 



2. Also, e(p2n) is real. 

3. Letp2n (irid P2^, be two paths that satisfy (a), (b), (c), (di) and have the 
same diagram. If the corresponding weights are non-negative, z{p2n) = 

4. The statements 1,2,3 are also true for k-paths, for any k > 1. 

Proof. The first part follows from the multiplicativity of the absolute value 
and Jensen's inequality. To prove the second part, note that, for any ( G S^, 

C ^^{P2n)C = ( ^ {^AuguiAuiu2 ' ' ' ^U2„-iU2„) ( 

(since A 

uv ~ C ^AuvC-) Similarly, the third part is true since rr' ~ unif^jS*"^) 
for independent r, r' ~ unif(5''^). The same arguments are valid for the fourth 
part. 

□ 

Applying the lemma and proceeding as in Part [TTl we see that, for -C 
A^-*^/^, the asymptotics of expectations E]^trP„.(A/(2A/A^ — 2)) is given by 
a series in n^J"^ /N^/"^, the coefficients of which are sums over diagrams (one 
may compute these coefficients recursively, but we shall not need this.) 

To extend these results to general matrices, we proceed as in Part IIIII 
First, observe that if p2n is a path without 'loops' on which every edge appears 
exactly twice, then 

for any (Hermitian self-dual) random matrix A, the elements of which satisfy 
(A34). Hence the contribution of paths that satisfy (a), (b), (c), (di) is 
asymptotically the same as in the uniform case ( IV.l.lll . The contribution of 
other paths is dominated by the corresponding term for jS = 1, and hence is 
negligible. 

These considerations show that Theorem II. 5. 31 is valid also for (3 = 4. 
Similarly, Theorem II.5.4I can be extended. Theorems II.l.ll II.1.21 and II. 1.31 
follow by the arguments of Section II. 5| which remain valid without any mod- 
ification. 
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Matrices with unequal real and imaginary parts. In [T^ Chapter 14], 
Mehta considers the following ensemble of random Hermitian matrices: 



liV(0,l/(l + a2))+,iV(0,aV(l + a2)) , u^v 



(where of course the entries above the diagonal are independent.) Taking 
a = 0, we recover GOE, a = 1 yields GUE, whereas a = oo yields what is 
called the Anti- Symmetric Gaussian Orthogonal Ensemble (AGOE, cf. p!5| 
Chapter 13].) 

It may be natural to consider the following generalisation: again, A will 
be a random Hermitian matrix as in Theorem II. 1.31 with (A3/3) replaced with 



(A3°2) Er2 = i^. Erf = 1. 

Exactly as in the preceding proofs, one can show that, for any < a < 
+00 (that may depend on A^), the distribution of the largest eigenvalue of 
A is asymptotically the same as in the Gaussian case flV.1.2p . Again, the 
proof passes through an analogue of Theorem II. 5. 31 which yields a diagram 
expansion. 

Corollary V.1.2. Let A he a random matrix as in Theorem \I.1.3\. with (ASjs) 
replaced with (A3I2), O'^'^d let Xn be its largest eigenvalue. 

1. //0<a<A^-l/^ 

N^^^Xn - 2N^/^ ^ TWi ; 

2. IfN~^l^ < a < +00, 

N'^'^Xn - 2A^2/^ ^ TW2 . 

That is, the crossover from GOE asymptotics to GUE asymptotics occurs 
at a ^ A^~^/^ This is of course coherent with [151 (14.1.31)], which asserts 
that the crossover should occur for 



1 + 



Average spacing between eigenvalues 



(the average spacing at the edge is of order N^^^, cf. Theorem II. 2. 41 ) Also, 
the GUE asymptotics for the largest eigenvalues is valid up to a = +00; this 
is coherent with the analysis in [13, 13.2.2]. 
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Sketch of proof. If a pathp2n = UqUiU2 ■ ■ ■U2n-iUo satisfies (a), (b), (c), (di), 



F4 A ... 4 — f 

\1 + 

where ni is the number of edges passed twice in the same direction. For n of 
order N^^^, the diagrams of the paths that contribute to the asymptotics are 
generated by the automaton of Section ULT] that stops after s = 0(1) steps. 
If the diagram is not of type /5 = 2, rii will be of order N^^^, and hence the 
contribution of p2n will be of order 



1 + 

This expression is 1 + o(l) for < a <^ N~^/^, and negligible for 



For a 1, the contribution of diagrams that are not of type (3 = 2 is 
negligible for a different reason. Namely, if there is at least one "loop" (in 
the sense of Section III.2P that is passed twice in the same direction, the 
contribution of paths with even and odd weights on this loop nearly cancel 
each other. 

The same applies to fc-paths and fc-diagrams. □ 

Forrester, Nagao and Honner ^ have studied the extreme eigenvalues of the 
Gaussian ensemble (IV.1.21) in the crossover regime a'^N^^^ t. In particular, 
they have computed the limiting correlation measures for the point processes 

Our argument shows that their results extend to general matrices A^^'' that 
satisfy the assumptions of Corollary IV.1.2[ 

Similar results can be proved for ensembles interpolating between (3 = 2 and 
/? = 4, and for sample covariance matrices interpolating between (3 = 1 and 
(3 = 2 and between (3 = 2 and (3 = 4:. 
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V.2 Deviation inequalities for extreme eigen- 
values 



Explicit upper bounds for the probability that the extreme eigenvalues devi- 
ate from their mean have various applications. The reader may refer to the 
lecture notes by Ledoux [TT] for an extensive discussion and references. The 
following estimates follow from Theorems II.5.3III.5.4[ 

Corollary V.2.1. 

1. For A as in Theorem \I.1.3[ 

P|||A|| > 2v^(l + < Cexp{-C-^Ne^/^) , 

where the constant C > may depend on Cq from (A2). 

2. For B as in Theorem \I.1.2\ 

(a) p|Am(5) > {VM + ^/Nf + eN'j < C exp{-C-Hie^/^) , 

(b) p|Ai(fi) < {y/M - y/Wy - eN^ < ^— ^=exp(-C"iMe3/2). 

In slightly less general form, the estimate 1. follows from the recent work 
of Aubrun [1] and Ledoux [IDlElIlS] • Estimates similar to 1. and 2. (a) can 
be probably also derived from bounds on traces of high moments, similar 
to those considered by Soshnikov and Peche [2n[22| [T6]. The estimate 2.(b) 
seems to be new. 

Proof. We shall only prove the first estimate (deducing it from Theorem ll.5.31 ) 
The estimates 2. (a), 2.(b) can be similarly deduced from Theorem II.5.4I 

For e < CN~'^^^, the atatements is trivial. For larger e, we have by the 
estimate 1. in the proof of Theorem II.2.4t 

C N 

Etr(A/(2v^))2- < ri^exp(C^mVAr2) . 
Now take m = ./-^N and apply Chebyshev's inequality. □ 
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We conclude with a short discussion of the fluctuations of Ai(-B) for M 
approaching N , and (two forms of) an open question. 

The inequality 2.(b) in Corollary IV. 2 . 1 1 shows that the order of the fluctu- 
ations of Ai(i?) is at most 0{N^^^^°^^^). On the other hand, for the Gaussian 
case -Binv, the fluctuations are of order 

O {{N - M + lY/^/N) , (V.2.1) 

which is strictly smaller when M = N — o{N). It is therefore natural to 
ask whether (]V.2.ip holds under the general assumptions of Theorem II. 1.21 
Recently, Rudelson and Vershynin [17] have proved this for N — M = 0{1); 
to the best of our knowledge, the intermediate case 1 <^ N — M <^ N is still 
open. 

One may also ask whether the assumption limsupM/A^ < 1 in Theo- 
rems II.1.1|I.2.2I can be relaxed to N — M oo (as is the case for Anv, cf. 
Borodin and Forrester [3]). A positive answer to this question would imply 
a positive answer to the previous one. 

Added in proof: The regime N — M = 0{1) ("hard edge") has been recently 
further studied by Tao and Vu [23], who have proved an universality result 
for Xi{B). 
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